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Abstract — Recently a new optimal control modification has 
been introduced that can achieve robust adaptation with a large 
adaptive gain without incurring high-frequency oscillations 
as with the standard model-reference adaptive control. This 
modification is based on an optimal control formulation to 
minimize the ..JA norm of the tracking error. The optimal 
control modification adaptive law results in a stable adaptation 
in the presence of a large adaptive gain. This study examines 
the optimal control modification adaptive law in the context 
of a system with a time scale separation resulting from a fast 
plant with a slow actuator. A singular perturbation analysis 
is performed to derive a modification to the adaptive law by 
transforming the original system into a reduced-order system 
in slow time. A model matching conditions in the transformed 
time coordinate results in an increase in the actuator command 
that effectively compensate for the slow actuator dynamics. 
Simulations demonstrate effectiveness of the method. 


in the singularly perturbed system. The singular perturbation 
approach transforms the original system into a reduced-order 
system in slow time. A model matching condition is applied 
to the reduced-order system and the reference model in the 
transformed slow time coordinate that increases the actuator 
command to accommodate the slow actuator dynamics. The 
resulting control signal can then track the reference model 
better than if the actuator command is not modified. 


II. Singularly Perturbed Systems with Slow 
Actuators 


Given a nonlinear plant as 


x = Ax + B 


+ ©* T <I>(x)+w(f) 


( 1 ) 


I. Introduction 

In the conventional MRAC framework, the tracking error 
is generally inversely proportional to the magnitude of the 
adaptive gain. However, a large adaptive gain can lead to 
high-frequency oscillations which can excite unmodeled dy- 
namics that could adversely affect the stability of an MRAC 
law [1], Various modifications were developed to increase 
robustness of MRAC by adding damping to the adaptive 
law to reduce high-frequency oscillations. Two well-known 
modifications in adaptive control are the cr-modification [2] 
and £i- modification [3], These modifications have been used 
extensively in adaptive control. Recently, a new modification 
has been introduced that is based on an optimal control 
formulation to minimize the J^Vnorm of the tracking error 
[4], The optimality condition results in a damping term in 
the adaptive law proportional to the persistent excitation. The 
optimal control modification has been shown to be able to 
achieve fast adaptation with a large adaptive gain without 
compromising stability robustness while preserving tracking 
performance. This study extends the development of the 
optimal control modification adaptive law to the case when 
there exists a time-scale separation between a fast plant and a 
slow actuator which prevents the plant to follow a reference 
model even in the presence of adaptive control. A singular 
perturbation approach is used to separate the time scales of 
the plant and actuators and then modify the optimal control 
modification adaptive law to account for the slow actuator 
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where x(t) : [0,°°j — ■> R” is a state vector, u (f) : [0,°°) — * R” is 
a control vector, A £ R" x " and B £ R" x " are known matrices 
such that the pair (A.B) is controllable and furthermore A 
is Hurwitz and B is invertible, 0* £ W >/n is an unknown 
constant weight matrix, <f> (x) : M" — > R p is a known bounded 
basis function and is at least piecewise smooth in x, and 
w(t) : [0,°°) — > R" is a small unknown bounded disturbance 
with || w (f ) || < wo and w £ for all t. 

The controller u(t) is subject to linear dynamics 

u = £A(u — u c ) (2) 

where u c (t) : [0,°°) — > M" is an actuator command vector, 
A £ R” x " is a known Hurwitz matrix, and £ is a positive 
constant and is assumed to be known by estimation. 

The objective is to design the controller u (t) that enables 
the plant to follow a reference model 

Xm — A n/Xfn T B tn r (3) 

where A m £ R' IX " is Hurwitz and known, B,„ £ R” xm is also 
known, and r(t) : [0,°°) — » W’ £ Jfoc is a bounded command 
vector with r £ 2zf 0 „. 

Consider the case when £ C 1 is a small parameter 
and £ ||A|| <C ||A||. Then x(t) is a fast state and a(t) is a 
slow control. To decouple the fast and slow variables, the 
singular perturbation method is invoked using a slow time 
transformation 

X = St (4) 

where x is a slow time variable. 



Then, the plant and actuator models are transformed into 
a singularly perturbed system as 


dx 

£ — = Ax + B 
d X 


+ ©* 1 4> (x) + w ( t ) 


(5) 


— =A(u-u c ) (6) 

d X 

The Tikhonov’s theorem can be used to approximate the 
solution of the singularly perturbed system with the solution 
of a “reduced-order” system by setting e = 0 [5], Then, 
x(u,w,£) is on a fast manifold. Thus, the reduced-order 
system is given by 


B ^ Ax o + mo + ©* T< t > (xp) + w ^ ^ 

= u 0 + w(^j+f(x 0 ) = 0 (7) 


duo . 

—— = A (m 0 -m c ) 
dx 


( 8 ) 


where xo and uq are the “outer” solution of the singularly 
perturbed system. 

The “inner” or “boundary layer” solution for this system 
is obtained from 


Xj = Axj + B I Uj + ©* T< f > (x; ) + w ( t ) 
M; — u c = 0 

The solution is then expressed as 

X (t ) = xo (t) + Xi ( t ) - xmae (t) 


(9) 

( 10 ) 


where Xmae U) is a correction term by a matched asymptotic 
expansion method applied to both the inner and outer solu- 
tions [6], The outer solution is in fact the asymptotic solution 
of the original system as t — » 

The algebraic solution of Eq. (7) can be expressed in 
general as 

X 0 (mo,W,£) =g(n 0 + tvQ)) = - f~ X ( M 0 + w(^)) (11) 
assuming f~ l exists. 

Differentiating Eq. (11) with respect to the slow time 
variable and then substituting the actuator model into the 
result yield 


dx o 
dx 


dg_j 

duo 


—B 1 Axq — ©* T< I > (xo) — w — u c 


From Eq. (7) 

dg _ dg_ 
duo dw 


fl- 1 A + ©*W(x ( 0 ) 


dg dw 

dw dx 

( 12 ) 

(13) 


Consider asymptotic solution of the singularly perturbed 
system. Then, in slow time, the reference model is expressed 

as 

a ^ = -{A m x m +B m r) (14) 

dx e 

Note that since dg/duo contains the uncertainty, the 
control design is quite complicated. In order to simplified 


the solution, the uncertainty term is assumed to be small. 
That is 


©* T $ (x) 


< ||fl _1 A 


(15) 


Then, using the matrix inversion lemma 


fl _1 A + 0* T <l>'(x) 


-l 


= A~ l B 


-A~ X B 


(©*V(x)) 1 


A~ l B 

A~ l B (16) 


« 1^7 — A _1 B©* t T> (x)J 

The following choice for the actuator command is made 
u c K x x K r r u a d (17) 

where 

(18) 


Kr = A^'ZU’A-A,,, — B~ l A 


K r = A~ 1 B~ l A-B n 


(19) 


Using the result of the matrix inversion lemma, the closed- 
loop singularly perturbed system now becomes 


dx 

dx 


I -A-' (x) -( A m x+B m r ) 


J £ 






■££ <*» 


Then, the adaptive signal u ac i can be designed to keep 
the following expression small by a judicious choice of a 
new basis function <f>i (x,r) : R" x R'" — > R 9 that spans the 
unknown constant parameter space ©j £ R 9Xn such that 


-A~ l B® 


*jd^ (x) 1 


dx 


{A m x H- B m r ) 




Uad-®* T ^{x)-w[^j 


dg 


= —A~ l BA©l «*>, (x, r ) + <p (x,r) - ^Aw ( J) + ^ ^ 


dg dw 

dw dx 

dg dw 

dw dx 

( 21 ) 


where ©i = ©i — ©J, and <p(x,r) is an approximation error 
which is to be kept small by a suitable choice of basis 
functions. 

Solving for u a d yields 


Uad = — A - 1 7 + ©* t T>' {x)A~ x B ©* T T>'(x)-(A,„x + B,„r) 
-l „ 


+ ©* 1 <I>(x)+A- 1 ( ^ ) I — A^BA©. 1 <J>i (x, r) + (p (x, r) 

1 ou 1 


( 22 ) 


From the assumption in Eq. (15) and neglecting the term 
A = ©* <f> (x) A “ 1 B©* T <t> (x), then one possible choice for 
the new basis function is 

< &i(x,r)= [ <5(x) ^ (x)x <t> (x)r ] T (23) 

Alternatively, the universal approximation theorem for 
neural networks can be used to approximate the uncertainty 



with a suitable choice of basis functions such as radial basis 
functions or sigmoidal basis functions [7], 

The closed-loop plant model in slow time is expressed as 

— = - (A m x + B m r) (x) B\8 (x, -) (24) 

where B\ = eA~ 1 BA and S (x, |) = 

Since A m is Hurwitz and if is bounded, then the 
Tikhonov’s theorem guarantees that the reduced solution 
with e > 0 converge to the solution of the original system 
with e = 0 as e — > 0. 


This results in the following equations obtained by a 
method of separation of variables 

dP 1 / T \ 1 

- + -(PA, + AIP) + 1 Q = 0 (31) 

is+i(Aj s+ ™,)=o <32) 

1 / t d(®T<S>) 1 

— PB l ©t T <b-<S) — QA = 0 (33) 

e V 1 dx e 

For an infinite time -horizon problem when x / — + °°, then 
P(x) ->P(0) and S(x) ->5(0) for all t G [0,°°). So, both P 
and 5 tend to their constant solutions where 


III. Optimal Control Modification Adaptive Law 


PA m +Aj n P = -Q (34) 


The tracking error equation in slow time is obtained as 

©7^1 Cm) + <5 (x, ^ 


de dx m dx 1 , 1 

c/t dx dx e e 


(25) 

We are interested in seeking an update law for © that 
minimizes the following cost function in slow time 


5 = -a;Jpb x (35) 

Without loss of generality, a weighting constant v > 0 G K. 
is introduced to allow for adjustments of the modification 
term in the adaptive law. Then, v = 1 gives an optimal 
solution. Thus, the adjoint p becomes 


/= lim — [ f (e-A) T Q(e-A)dx (26) 
T /-~ 2e J o 

subject to Eq. (25) where A represents the unknown lower 
bound of the tracking error and Q = Q 1 > 0 G R'“". 

This optimal control problem can be formulated by the 
Pontryagin’s Maximum Principle. Defining a Hamiltonian 


(e, ©[<*>!) = ^( e -A) T 2( e - A) 

+ ~P^ {^m e + Bi©7 < J ) i AB\ 5^ (27) 

where p(x) : [0,<=°) — > K" is an adjoint variable, then the 
necessary condition is obtained as 

% = = ~ \ Q (e ~ A) - \A T m p (28) 

with the transversality condition p(xf) =0 since e(0) is 
known. 

The adaptive law which provides an optimal control solu- 
tion can be formulated as a gradient update law as 


d® i 
dx 


d® i 

dx 


rv//e, 


l 

e 


T^ip T Bi 


(29) 


where T = T t > 0 G W> XCI is an adaptive gain matrix. 

An “approximate” solution of p can be obtained using a 
“sweeping” method [8] by letting p = Pe + S®\ <i>i, where 
P = P T > 0 G R" x " and S G R" x ”. Substituting into the 
necessary condition yields 


[A m e + B\®\^\ + fii<5) + ^©7^1 
+ S V j T lJ =--Q(e-A)--Aj n [ y Pe + S®J<P l ) (30) 


p = Pe- vA m J PB\®J 4>, (36) 

Substituting Eq. (36) into the gradient-based adaptive law 
yields the adaptive law in slow time 

= -V (<& l e J PB l - v®!®] ® iBJ PA m l B^J (37) 

Converting to regular time by multiplying e through Eq. 
(37) results in the optimal control modification adaptive law 

01 = ~r(® l e T PB 1 -v<I> 1 <pJ®iBjPA- 1 B 1 ^ ( 38 ) 

A. Stability Proof 

Choose a Lyapunov candidate function 

V = e T Pe + trace ^©^r^ 1 ©^ (39) 

Evaluating dV /dx in slow time yields 

— = -e T (pAm+A^P^j e+^e r PB\ ^©|<I>i + <S^) 

- ^ trace (©7<I>ie T Pfii - (40) 

By the trace property trace (X J Y) = YX ' , then 

jy 12 22 

— = --e T Qe+ -e T PB x ©7 <J>, + -e r PB l 8 - -e J PB® 7 T>i 

dx e e e e 

+ -v<pJ® l BjPA m l B@J<P ] + -v<PJ®* l BjPA w l B®J<P , 

(41) 

PA m ] can be decomposed into a symmetric part 
M = \ (PA m ] +A m J P'j = — ^A~ J QA~ X < 0 and an anti- 
symmetric part N = 7 ( PAf n x —A~ T P). Then, PAf/ = M + 



N. By the property of a symmetric matrix, since M < 0, 
therefore PA ~ l < 0. Thus 

dV 1 T 2 T 

— = — e'Qe+ -e T PB\8 
dx e e 




+ -v4>J ®\B[ PA m l B\®{ <f>, (42) 

Using the property of an anti-symmetric matrix y 1 Ny = 0, 
dV /dx is then bounded by 

^<-^WG)lMl [lkll- 2 || J p« 1 ||ab] 




(a- t qa~ 1 


II b iII 2 II©i|| ll^tll 2 


[||© 1 ||-2||M ; - 1 ||©S] (43) 

where 8q = sup vg ^ T ||5|| in some compact domain Pd C R", 
and 0 q = || ©i || - 

Let B r be a compact set where 

2||FBi||5b 


B r =\ (e,©!) er'xr" : ||e|| <r = 


^ min ( Q ) 


|©l|| <K = 


2||m m 1 ||©s 

min {^A-m QAm 


(44) 


Then dV /dx < 0 in Br — B r where B, C Br = 
{e G R" : ||e|| <R} C S>. Let Bp be the smallest subset that 
encloses B r , then there exists /) > 0 where 


P = Knax (P) r 2 + Kax (r l ) K 2 


such that 


B r C Bp = { (e, ©i) erxl« x “:y < /3 } 


(45) 


(46) 


Let B a be the largest subset enclosed by Br, then since 
H| < R in Br, there exists a > 0 where 


^ min (P)\\e\\~ <V <l min (P) R 2 = a 


(47) 


such that 


B a = {(e,® l ) eR" xR« x " :U< a} c B R (48) 

Then for a solution to be uniformly bounded, the set 
containment is as follows: 

B r CBpCB a C Br (49) 

This implies 

(3 < a <=> Kax (P)r 2 + Kax (r —1 ) K 2 < X min (. P ) R 2 (50) 
Therefore 


R > 


I Kax (P) X 2 + Kax (r 1 ) K~ 
Xmin ( P ) 


= P 


(51) 


Then p is the ultimate bound of e ( t ) such that 


r<|H|<p<i? (52) 


Since dV /dx < 0 for all (e,©i) £ BR—B r , therefore V 
is a decreasing function of time outside of B r . Thus, if 
(e(0) ,©i (0)) € B a , then according to Theorem 5.1 of Ref. 
[9], the solution will eventually enters Bp after a finite 
time t = T (independent of (e(0) ,0(0) ,AB(0)) and a) and 
remain therein for all t > T. Therefore, e (t) is uniformly 
ultimately bounded with an ultimate bound p. 

Since e (t) and ©i (f) are bounded, the unknown lower 
bound of the tracking error A at t = tf — > °° is also bounded 
such that 


l|A|| < 


IHgill (p + Sp + vr ] ||a 

X min ( Q ) 


-T| 


(53) 


where ©| t T>i < p <E R and 


dt 


< 7] G R for all t. 


B. Example 

Consider the following simple scalar system 

x = ax+bu+ 9*x + w (t) (54) 

with actuator dynamics 

it = eX (m — up) (55) 

where a < 0, X < 0, e > 0, |eA| < |a|, and w(t) is a small 
disturbance signal. 

The reference model is 

Xm z a m \m + b m x (56) 

where a m < 0. 

The actuator command is designed as 

^ ( dim , \ ab m / \ /e r -j\ 

u ’ = b(a- l ) x+ M r - 0 * (v> <57) 

where «I> (x, r) = [ x r ] T . 

Note that if actuator dynamics are fast then the actuator 
command is 

di f dim b m 

u c =-\ l)x+—r—9x (58) 

b V a / b 

The optimal control modification update law for slow 
actuator system is 

0 = -r ( - v<M> t ©i 

V dl m J 

( bX -r b 2 X~\ 

= -er[T>e evT>T> ©i — ) (59) 

\ a a~a m J 

where b\ = and for fast actuator system is 

0 = -r (xeb - vx 2 e — ^ (60) 

V dl m J 

If a and X are nominally in the same order of magnitude, 
then we note that for the slow actuator system, the effective 


where p is the smallest value of R. 



adaptive gain is also reduced by e for a similar performance 
as that for the fast actuator. 

For the numerical example, a = —l, b - 1 , 0* = 0.1, 
A = — 1 , e = 0.1, a m = —5, b m = 1 , r{t) = sinf, w(t) = 
0.05 sin 1 Of, r = 2850, and v = 1. The responses due to 
the standard MRAC adaptive law with and without the 
slow actuator compensation by the singular perturbation are 
plotted in Fig. 1. The adaptive gain T was purposely selected 
high enough that the standard MRAC begins to exhibit 
instability due to slow actuator dynamics with the singular 
perturbation. The uncompensated response does not follow 
the reference model as expected. 

i , , , , 
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Fig. 1 - Response due to MRAC 

The responses due to the optimal control modification 
adaptive law with and without the slow actuator compen- 
sation by the singular perturbation are plotted in Fig. 2 
for the same adaptive gain T and V = 1. Without slow 
actuator compensation by the singular perturbation, the re- 
sponse cannot track the reference model. However, with 
slow actuator compensation by the singular perturbation, the 
optimal control modification produces a response that tracks 
the reference model very well. 
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Fig. 2 - Response due to Optimal Control Modification 
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IV. Flight Control Example 
Consider aircraft pitch attitude dynamics 

CLf,qSc 


mV 


t 2V n 

C '”a 

2V 


-C La qS mV - 


0 

yy 

CL q qSc 



a 


q 


C, 


m a 


2V 


a 

q 


^Lg, 

C, 


m s e 


(5, 


0 fT O 


where 4> = [ a q ] T and ©* T = [ 0 —0.1657 ] is a 

parametric uncertainty that represents an 80% reduction in 
the pitch damping coefficient C niq . 

A numerical model for a full-scale generic transport model 
(GTM) at Mach 0.8 and 30,000 ft is given by 


a 


' -0.7018 

0.9761 


a 

q 


-2.6923 

-0.7322 


c i 


-0.0573 

-3.5352 


(8 e - 0.1651 q) 


B 


The elevator output is prescribed by degraded actuator 
dynamics 

u = eA (u — u c ) (62) 

where A = 50 rad/sec and e = 0.01. 

A desired reference model of the pitch attitude is given by 


(63) 




0 

i 

Gm 


0 



. -«« 

— 2C (On _ 

Qm 

+ 

. . 


where £ = 0.85 and co„ = 1.5 rad/sec are chosen to give a 
desired handling characteristic. 

Let x = [ 0 q ] and q = Cx where C = [ 0 1 ] . 
A desired nominal controller is designed as u* = k* a a + 

C(A m —a22l) 

a 

2^(O n +a22 


K*x + k*r where k* a = -^ = -0.7616, K* = 


b 2 


b 2 


= [ 0.6365 0.5142 


by 

and k* = 


( ^f- = jyy = —0.6365. The closed-loop eigenvalues are 
—0.6582 and -1.2750 ±0.7902/. 

The actuator command without a compensation for the 
slow actuator dynamics is given by 


u c = k* a a + K*x + k* r r - © T T> 
© = r <S>ePB 


(64) 

(65) 


where e = x m — x. 

Applying the singular perturbation method to compensate 
for the slow actuator dynamics, the reduced-order system is 
obtained as 

(66) 


021 „ «22 

u = — - — a — - — x— 0 q 
bi b 2 


The compensated actuator command is then given by 
u c =k* a a + K x x + k,r — ® J, V>\ (a,x,r) 


(67) 


where <f>i ( a,x,r ) = [ a x r 1 and 


K r = 


^22^ / A 


= 


&2 \eA 

aiiCB, 


-I 


b 2 eX 


( 68 ) 

(69) 


©i = -r<J>, (e T B-v<I>7©iB ] r PA m 1 


Bi 


(70) 


where fij = 


b 2 eX 
a 22 


1 T 


If 2 = ql > 0, then B[ PA m 1 B \ = — ^4 and the adaptive 

law can also be written as 


0 i = -r( tf> i g 1 / j /j i + / //,2 f \ ^ 
24i< 


©1 


(71) 


Figure 3 shows the pitch attitude response to a pitch 
doublet reference signal due to the standard MRAC. An 
adaptive gain of T = 100 is used. Without compensation for 
the slow actuator dynamics, the pitch angle does not track 
the reference model. When the singular perturbation is used 
to compensate for the slow actuator dynamics, a high gain 
situation is encountered and the response produces a high 
frequency signal. This high gain is due to the factor of 1 /ek 
that appears in the control gains which effectively increases 
the actuator command to compensate for the slow actuator 
dynamics. 




Fig. 3 - Pitch Attitude Response due to MRAC 

Figure 4 shows the simulation results with the optimal 
control modification. The adaptive gain is kept the same and 
the tuning parameter v = 1 is used. Without compensation, 
the pitch angle tracks the reference model reasonably well, 
although there is a delay in the response of about 1 sec. 
When the singular perturbation is used to compensate for the 
slow actuator dynamics, the delay is reduced, but in turn, the 
signal exhibits a higher frequency content. 




Fig. 4 - Pitch Attitude Responsse due to Optimal Control 
Modification 

V. Conclusions 

This paper presents a singular perturbation approach in 
connection with an optimal control modification adaptive 
law for a time-scale separated system with slow actuator 
dynamics. The singular perturbation approach transforms the 
system into a reduced-order system in a slow time coordinate. 
The actuator command is obtained by the model matching 
condition in the slow time coordinate. The resulting actuator 
signal in effect is increased by the ratio of the norm of the 
plant’s transition matrix to the norm of the slow actuator’s 
transition matrix. The optimal control modification adaptive 
is derived and analyzed for stability using the Lyapunov 
method. Under fast adaptation when the adaptive gain is 
large, the analysis shows that the tracking error remains 
bounded and stable. Simulation results demonstrate the ef- 
fectiveness of the method. 
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